home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
Skunkware 5
/
Skunkware 5.iso
/
src
/
Tools
/
glimpsehttp
/
README
< prev
Wrap
Text File
|
1995-05-16
|
5KB
|
107 lines
NAME
GlimpseHTTP
WHAT IS GLIMPSE
Glimpse (which stands for GLobal IMPlicit SEarch) is an indexing and
query system that allows you to search through lots of files in many
(possibly nested) directories very quickly.
Glimpseindex, which you run by saying glimpseindex <directory(ies)>
builds a very small index (2-5% of the text).
With it, glimpse can search through all the files in these directories
much the same way as grep, except that you don't have to specify file
names. Glimpse supports most of agrep's options (agrep is our
powerful version of grep, and it is part of glimpse) including
approximate matching (e.g., finding misspelled words), Boolean queries,
and even some limited forms of regular expressions.
DESCRIPTION
GlimpseHTTP is a collection of tools that allows you to incorporate
glimpse in WWW documents. With it, you can provide general
search capabilities to any user without incurring too much space
overhead. Furthermore, these tools allow you to integrate search with
browsing. If you have several nested directories which the user may
browse, you can include the glimpse interface in each document such that
only the relevant directories will be included in the search. More
details are given below.
The current version of GlimpseHTTP was
tested under httpd 1.2 HTML server from NCSA and
Glimpse currently works on many Unix platforms.
To search and browse the information any HTML browser can be used
(this includes NCSA Mosaic for X-Windows, MS-Windows and
Macintosh, Lynx and other browsers. For maximum convenience
your browser should support forms, although minimal
functionality can be achieved with any browser).
Since GlimpseHTTP uses Glimpse, this provides some unique features
- A very small index (3-5% of the total text).
- Reasonably fast search.
- Search for approximate match allowing errors.
In addition, GlimpseHTTP provides you with the following
capabilities:
- You can use a combination of browsing and searching:
first, you locate the directory where the relevant
information can be stored, then you can use search
to locate specific files.
- The result of the search is a nicely formatted hypertext with
hyperlinks to matching documents.
- Following the hyperlink leads you not only to a particular
file, but also to the exact place where the match occured.
- Hyperlinks in the documents are converted on the fly to
actual hyperlinks, which you can follow immediately. This
makes the GlimpseHTTP particularily suitable for searching
meta-information (Internet directories etc.).
- Similar tools are provided for archiving and searching
USENET newsgroups. You can maintain the archive of news articles
and allow people to search your archive using the
same interface. Features supported include kill-file for articles
and fast search for particular posters. Since news archiver uses
NNTP interface, you can archive news articles from remote
news servers. (Browse and search for news is yet to be
implemented: browsing in this case means selection of pertinent
newsgroup(s), currently supported is only the search within
one newsgroup a time)
Among the possible applications of GlimpseHTTP we envision:
- FTP sites with search possibilities;
- news archiving sites;
- any search application which should be accessed over local
or global network where searching for approximate match and/or
saving of disk space for indices is an issue.
GlimpseHTTP components:
1. aglimpse - "Archive Glimpse" - a tool for searching file
hierarchies indexed for Glimpse. aglimpse is a CGI-compliant
program which performs the search and formats the output as
HTML document with hyperlinks to the matches.
2. Administrative tools which facilitate maintaining and
indexing of Glimpse archives. One of the programs is the
HTML indexer which prepares hypertext indices for
each searchable directory - this supports the concept
of combined browsing and searching.
3. GlimpseNews - a collection of tools for archiving and
searching newsgroups archives.
SEE ALSO
http://glimpse.cs.arizona.edu:1994/glimpsehttp.html -
GlimpseHTTP home page.
http://glimpse.cs.arizona.edu:1994 - Glimpse
developers home page.
README.install - directions on installing GlimpseHTTP
on your server.
README.amgr - description of Archive Manager.
README.indexing - descriptioN of HTML indexer.
AUTHORS
Paul Klark (GlimpseHTTP)
Udi Manber, Sun Wu, and Burra Gopal (Glimpse)
University of Arizona, Department of Computer Science
To be put on glimpse mailing list, send mail to
glimpse-request@cs.arizona.edu